AITopics | debugging test

Collaborating Authors

debugging test

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Debugging Tests for Model Explanations

Neural Information Processing SystemsDec-23-2025, 17:36:27 GMT

We investigate whether post-hoc model explanations are effective for diagnosing model errors--model debugging. In response to the challenge of explaining a model's prediction, a vast array of explanation methods have been proposed. Despite increasing use, it is unclear if they are effective. To start, we categorize \textit{bugs}, based on their source, into: ~\textit{data, model, and test-time} contamination bugs. For several explanation methods, we assess their ability to: detect spurious correlation artifacts (data contamination), diagnose mislabeled training examples (data contamination), differentiate between a (partially) re-initialized model and a trained one (model contamination), and detect out-of-distribution inputs (test-time contamination). We find that the methods tested are able to diagnose a spurious background bug, but not conclusively identify mislabeled training examples. In addition, a class of methods, that modify the back-propagation algorithm are invariant to the higher layer parameters of a deep network; hence, ineffective for diagnosing model contamination. We complement our analysis with a human subject study, and find that subjects fail to identify defective models using attributions, but instead rely, primarily, on model predictions. Taken together, our results provide guidance for practitioners and researchers turning to explanations as tools for model debugging.

contamination, debugging test, name change, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback

Review for NeurIPS paper: Debugging Tests for Model Explanations

Neural Information Processing SystemsJan-21-2025, 07:56:16 GMT

Weaknesses: Although I think the paper looked into an important question, I feel like the negative results from the user study largely confirm known issues of the attribution methods and previous results on evaluating interpretation methods. For example, the observation that in a cooperative setting, humans largely rely on model prediction while ignoring explanations is described in many HCI papers including but not limited to "On human predictions with explanations and predictions of machine learning models: A case study on deception detection" by Lai & Tan (FAT* 2019). Many of the empirical assessments are also done in previous papers. I'm having a hard time figuring out what new value this paper provides. The authors consider the bug categorization one of the contributions.

debugging test, model explanation, neurips paper, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Debugging Tests for Model Explanations

Neural Information Processing SystemsOct-9-2024, 11:06:22 GMT

contamination, debugging test, model explanation, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.71)

Add feedback